Using linear interpolation to improve histogram equalization for speech recognition

نویسندگان

  • Filip Korkmazsky
  • Dominique Fohr
  • Irina Illina
چکیده

This paper presents a novel approach to speech data normalization by introducing interpolation for histogram equalization. We study different ways of histogram interpolation that inhence this normalization technique. We found that using a special weighting factor to combine current and past test sentence statistics improved speech recognition performance. For the testing that used weighted histogram interpolation we achieved 44.85% phone error rate against 49.42% phone error rate for the testing without normalization and 48.59% phone error rate, when only a single test sentence histogram was used for normalization. Recognition experiments were conducted on speech data recorded in a moving car and proved advantage of using interpolation for data normalization by histogram equalization.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Histogram Equalization to Model Adaptation for Robust Speech Recognition

We propose a new model adaptation method based on the histogram equalization technique for providing robustness in noisy environments. The trained acoustic mean models of a speech recognizer are adapted into environmentally matched conditions by using the histogram equalization algorithm on a single utterance basis. For more robust speech recognition in the heavily noisy conditions, trained aco...

متن کامل

Classification of emotional speech using spectral pattern features

Speech Emotion Recognition (SER) is a new and challenging research area with a wide range of applications in man-machine interactions. The aim of a SER system is to recognize human emotion by analyzing the acoustics of speech sound. In this study, we propose Spectral Pattern features (SPs) and Harmonic Energy features (HEs) for emotion recognition. These features extracted from the spectrogram ...

متن کامل

Robust Spelling and Digit Recognition in the Car: Switching Models and Their Like

Performance of speech recognition systems strongly degrades in the presence of background noise, like the driving noise in the interior of a car. We aim to improve noise robustness focusing on all major levels of speech recognition: feature extraction, feature enhancement, and speech modeling. Different auditory modeling concepts, speech enhancement techniques, training strategies, and model ar...

متن کامل

Histogram Equalization Based Front-end Processing for Noisy Speech Recognition

In this paper, we present Gabor features extraction based on front-end processing using histogram equalization for noisy speech recognition. The proposed features named as Histogram Equalization of Gabor Bark Spectrum features, HeqGBS features are extracted using 2-D Gabor processing followed by a histogram equalization step from spectro-temporal representation of Bark spectrum of speech signal...

متن کامل

A new feature extraction front-end for robust speech recognition using progressive histogram equalization and multi-eigenvector temporal filtering

In this paper, a new feature extraction front-end for robust speech recognition using progressive histogram equalization and multi-eigenvector temporal filtering is proposed. The progressive histogram equalization (PHEQ) performs the histogram equalization (HEQ) progressively with respect to a reference interval which moves with the present frame to be processed. The multi-eigenvector temporal ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004